A naive Bayes classifier on 1998 KDD Cup

نویسنده

  • Chris Fleizach
چکیده

The 1998 KDD Data cup provides a large dataset that has a number of features which can be learned to attempt to predict potential respondents to a mailing. It is our goal to show that the naive Bayes classifier may be accurate enough to successfully choose who will reply to the mailing. By using cross validation, we hope to establish a basis for the expected performance. We also analyze the space and time complexity of the classifier in order to compare with the theoretical thresholds of the naive Bayes algorithm.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Using Text Classification to Predict the Gene Knockout Behaviour of S. Cerevisiae

A naive Bayes classifier was used to analyze gene behavior based on text data and presented as an entry for the 2002 KDD Cup, a data mining exercise to predict the behavior of the yeast S. Cerevisiae. The solution presented was based on the multinomial event model for text classification(McCallum & Nigam 1998) with a feature selection mechanism added. Despite this simple model, performance clos...

متن کامل

A K-Means and Naive Bayes learning approach for better intrusion detection

Intrusion Detection Systems (IDS) have become an important building block of any sound defense network infrastructure. Malicious attacks have brought more adverse impacts on the networks than before, increasing the need for an effective approach to detect and identify such attacks more effectively. In this study two learning approaches, K-Means Clustering and Naïve Bayes classifier (KMNB) are u...

متن کامل

A New Approach for Text Documents Classification with Invasive Weed Optimization and Naive Bayes Classifier

With the fast increase of the documents, using Text Document Classification (TDC) methods has become a crucial matter. This paper presented a hybrid model of Invasive Weed Optimization (IWO) and Naive Bayes (NB) classifier (IWO-NB) for Feature Selection (FS) in order to reduce the big size of features space in TDC. TDC includes different actions such as text processing, feature extraction, form...

متن کامل

Network Intrusion Detection Using a Hidden Naïve Bayes Binary Classifier

Using data mining techniques in intrusion detection systems is common for the classification of the network events as either normal events or attack events. Naïve Bayes (NB) method is a simple, efficient and popular data mining method that is built on conditional independence of attributes assumption. Hidden Naïve Bayes (HNB) is an extended form of NB that keeps the NB's simplicity and efficien...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006